中图分类
执行
    中文(共146篇) 外文(共9739篇)
    排序:
    导出 保存至文件
    [期刊]   Mohamed Elasri   Omar Elharrouss   Somaya Al-Maadeed   Hamid Tairi   《Neural processing letters》    2022年54卷5期      共38页
    摘要 : Abstract The creation of an image from another and from different types of data including text, scene graph, and object layout, is one of the very challenging tasks in computer vision. In addition, capturing images from different ... 展开

    [期刊]   Zaike Li   Li Liu   Huaxiang Zhang   Dongmei Liu   Yu Song   Boqun Li   《Multimedia Systems》    2024年30卷1期      共13页
    摘要 : Since locally controllable text-to-image generation cannot achieve satisfactory results in detail, a novel locally controllable text-to-image generation network based on visual-linguistic relation alignment is proposed. The goal o... 展开

    [期刊]   Zelaszczyk M.   Mandziuk J.   《Information Fusion》    2023年93卷      共28页
    摘要 : We review the existing literature on generating text from visual data under the cross-modal generation umbrella, which affords us to compare and contrast various approaches taking visual data as input and producing text outputs, w... 展开

    [期刊]   Qi, Zhongjian   Fan, Chaogang   Xu, Liangfeng   Li, Xinke   Zhan, Shu   《Pattern recognition letters》    2021年147卷Jul.期      共7页
    摘要 : Synthesizing photographic images from given text descriptions is a challenging problem. Although current methods first synthesize an initial blurred image, then refine the initial image to a high-quality one, the most existing met... 展开

    摘要 : With the advent of generative adversarial networks, synthesizing images from text descriptions has recently become an active research area. It is a flexible and intuitive way for conditional image generation with significant progr... 展开

    [期刊]   Hyeeun Ku   Minhyeok Lee   《Applied Sciences》    2023年13卷8期      共12页
    摘要 : Generative adversarial networks (GANs) have demonstrated remarkable potential in the realm of text-to-image synthesis. Nevertheless, conventional GANs employing conditional latent space interpolation and manifold interpolation (GA... 展开

    摘要 : Altering the content of an image with photo editing tools is a tedious task for an inexperienced user, especially, when modifying the visual attributes of a specific object in an image without affecting other constituents such as ... 展开

    摘要 : Composing Text and Image to Image Retrieval (CTI-IR) is an emerging task in computer vision, which allows retrieving images relevant to a query image with text describing desired modifications to the query image. Most conventional... 展开

    [期刊]   Dong, Chenhe   Li, Yinghui   Gong, Haifan   Chen, Miaoxin   Li, Junxin   Shen, Ying   Yang, Min   《ACM Computing Surveys》    2023年55卷8期      共38页
    摘要 : This article offers a comprehensive review of the research on Natural Language Generation (NLG) over the past two decades, especially in relation to data-to-text generation and text-to-text generation deep learning methods, as wel... 展开

    摘要 : Text-to-image generation aims to generate images from text descriptions. Its main challenge lies in two aspects: (1) Semantic consistency, i.e., the generated images should be semantically consistent with the input text; and (2) V... 展开

    研究趋势
    相关热图
    学科分类